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A3STRACT 


This  thesis  applies  hierarchical  clustering  and 
quadratic  discriminant  function  techniques  to  the 
problem  of  predicting  2-4  test  passers  and  non-test 
passers  (including  non-test  takers)  in  the  Mavy.  The 
ouograpnic  data  base  includes  items  such  as  test 
scores  and  education  to  serve  as  separators  and 
predictors  in  the  techniques. 


The  clustering  of  rates  permitted  accumulation  of 
personnel  in  the  lightly  staffed  ratings  into  similar 
groups  of  substantial  size,  was  objective,  and  may  be 
useful  for  purposes  other  than  the  present  one.  The 
discriminant  analysis  produced  correct  classification 
races  of  aoout  60  to  70  percent  oased  on  the  data  at 
hand. 
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INTRODUCTION! 


In  these  days  of  increasing  costs,  the  Navy  oust  ensure 
it  is  recruiting  and  retaining  the  best  possible  personnel. 
Enlistees  should  not  only  have  the  capacity  to  do  the  work, 
but  also  the  desire  and  drive  to  accomplish  the  assigned 
tasks.  Currently,  there  are  few  measures  of  effectiveness 
which  indicate  how  an  enlistee  will  perform  in  the  Navy. 
The  SCREEN  score,  developed  by  Dr.  Robert  Lockaan  of  the 
Center  for  Naval  Analyses,  predicts  attrition  of  first  term 
recruits  as  a function  of  test  score  and  biographic  data 
111.  It  does  not,  however,  specifically  address  the 
performance  of  a member  while  in  the  Navy;  rather  it 
predicts  survival  in  the  Navy. 

The  advancement  to  2-4  is  a possible  measure  of 
effectiveness  with  which  a member's  utility  to  the  Navy 
could  be  judged.  Pirst,  the  member  must,  display  a certain 
amount  of  initiative  and  drive  to  prepare  for  the  exam. 
Second,  passing  the  test  is  an  indication  that  a member  has 
reached  a basic  level  of  knowledge  within  a specific  rating. 
It  is  at  this  point  that  the  Navy  begins  to  get  a return  on 
its  investment  in  the  member.  The  3-4  test,  therefore,  may 
be  used  as  a measure  of  both  a member's  capacity  and  desire 
to  perform  in  the  Navy. 

This  thesis  explores  the  differences  between  the  S-4 
test  passers  and  non-test  passers  based  on  the  individual's 
biograpnic  and  demographic  data.  one  would  like  to  know 
what  variables  in  a member's  background  are  associated  with 
his  ability  and  desire  to  pass  the  exam.  Specifically,  an 
ordered  list  of  determining  factors  is  desired.  Also,  it  is 
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important  to  learn  if  tie  determining  factors  are  identical 
for  all  ratings  or  if  they  vary  from  rating  to  rating. 
Finally,  it  would  be  interesting  to  discover  bow  length  of 
time  in  service  and  time  in  rate  are  related  to  a member's 
tendency  to  advance. 

For  the  purposes  of  this  analysis,  a test  passer  is 
defined  as  a member  who  has  taken  the  E-4  exam  and  received 
a score  which  caused  him  to  be  advanced  to  E-4.  a non-test 
passer  is  a member  who  has  either  taken  the  E-4  exam  and  did 
not  receive  a high  enough  score  to  be  advanced  or  who  did 
not  take  the  exam  even  though  he  was  eligible.  host  of  the 
non-test  passers  in  the  study  are  non-test  taxers.  The 
separation  of  the  non-test  takers  from  the  non-test  passers 
was  not  possible  due  to  the  lack  of  data.  The  reader  will 
need  to  keep  these  definitions  in  mind  while  reading  the 
text.  For  clarity,  a graphical  presentation  of  these 
definitions  may  be  found  in  Figure  1 in  Chapter  III. 

The  data  used  in  this  study  coma  from  all  personnel 
eligible  to  take  the  E-4  exam  in  August  1977.  A member  was 
considered  eligible  if  he  had  at  least  one  year  in  the  Navy 
and  at  least  six  months  is  an  2-3 . 

Two  major  results  were  obtained  from  this  stuiy.  First, 
tne  use  of  non-linear  discrimination  resulted  in  correct 
classification  of  60  - 70S  of  the  memoers  of  each  sample 
examined.  Second,  hierarchical  clustering  was  applied  to 
group  the  ratirgs  (especially  those  with  small  numbers  of 
memoers) . These  groupings  may  have  general  use  beyond  those 
of  this  paper. 
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II.  DISCUSSION  of  data 


There  are  many  possible  factors  which  may  affect  a 
member's  desire  and  aoility  to  pass  tna  2-4  exam.  Examples 
of  these  are  years  of  education,  job  satisfaction, 
leadership  of  superiors,  sea/shore  assignment,  etc.  Data 
for  some  of  the  aore  appealing  of  these  was  unavailable  for 
tne  study.  The  factors  used  in  this  analysis  were  all  drawn 
from  the  August  1977  monthly  Enlisted  Master  decord  ( 5MR) 
file  located  at  the  Naval  Military  Personnel  Center  in 
Arlington,  Virginia.  The  month  of  August  was  chosen  because 
it  coincided  with  a month  in  waich  the  E-4  exams  were  given. 

The  2MR  is  a 3000  character  record  whica  is  maintained 
on  each  active  duty  and  reserve  enlisted  member  of  the  Navy. 
Such  items  as  social  security  number,  pay  entry  base  date, 
schools  attended,  and  test  scores  are  stored  on  each  record. 
2ach  record  is  a condensed  version  of  a member's  personnel 
jacxet. 

To  maice  the  data  easier  to  manipulate,  the  3000 
character  record  file  was  reduced  to  a 150  character  record 
file.  Each  record  contained  the  following  demographic  data: 

Age 

Sex 

Race 

Ethnic  group 


Roma  of  record 
Dependants 
Time  in  rate 

Active  duty/Seserva  duty  indicator 


Length  of  service 

End  oc  active  duty  obligation  date 

Enlistment  teem 

Years  of  education 

Education  certification 

A-School/No  A-Schoo 1 indicator 

Special  test  scores 

SCREEN  score 

Mental  aptitude  test  scores  ( AS V A3  or  3T3) 

The  special  test  scores  measure  aptitude  in  the  areas  of 
sonar,  electronics,  and  radio.  The  SCREEN  score  is  a 
measure  of  a recruit's  chances  of  completing  the  first  two 
years  of  his  enlistment.  rhe  mental  aptitude  tests  are  a 
measure  of  the  member's  overall  intelligence.  The  3asic 
lest  Battery  (3TB)  consists  of  five  exams  in  the  areas  of 
general  intelligence,  numerical  reasoning,  and  mechanical, 
clerical,  and  shop  aptitude.  The  Armed  Services  Vocational 
Aptitude  3attery  (ASVAB)  contains  sixteen  tests  measuring 
specific  areas  of  intelligence  and  aptitudi,  including  tne 
ones  in  the  3T3.  Unfortunately,  tne  A5YA3  had  not  been 
givea  to  a large  enough  sample  of  the  population  to  allow  it 
to  be  used  in  the  analysis. 

Prom  the  data  set,  the  individuals  were  extracted  whose 
length  of  service  was  at  least  one  year,  and  whose  time  in 
rate  was  at  least  six  months.  These  were  considered  the 
members  eligible  to  take  the  E-4  exam.  Since  completion  of 
correspondence  courses  and  practical  factors  is  not 
reflected  in  the  EM3,  it  was  not  possible  to  check  these 
eligibility  requirements.  however,  it  was  assumed  that  if  a 
member  fulfilled  the  time  requirements  and  desired  to  taxe 
the  exam,  he  would  nave  his  other  requirements  completed. 

The  variables  of  the  data  were  originally  in  three 
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scales;  binary,  nominal,  and  interval.  Nominal  data  are 
data  in  which  numbers  ace  used  as  labels  or  "names”,  e.g., 
31ac!t=1,  Chicano=2,  rfhite^S.  Binary  data  were  a special 
form  of  the  nominal  where  only  two  responses  are  available. 
Interval  data  include  all  continuous  variables  where 
differences  between  scores  are  meaningful.  For  example, 
psycnoiog ica  1 test  data  ace  treated  as  interval  data.  All 
of  the  nominal  data  were  converted  to  oinary  values  in  order 
to  meet  tne  assumptions  of  the  analysis.  For  instance,  the 
educational  certif ication  was  transformed  from  tne 
responses:  no  degree,  GED,  high  school  diploma.  Bachelor's 
degree,  and  postgraduate  degree,  to  nigh  school  diploma  or 
no  high  scnool  diploma.  The  GED  and  college  degree  holders 
were  grouped  with  the  high  school  diploma  holders. 


Some  variables  were  not  used  in  parts  or  all  of  tne 
analysis  oecause  it  was  found  there  were  not  enough  members 
in  the  sample  with  values  for  these  variables.  For  example, 
sex  could  not  be  used  in  analyzing  the  3oilar  Technician 
rating  oecause  there  were  no  female  memoers  in  the  sample 
population.  Likewise,  the  AS V A3  scores  could  not  be  used 
oecause  not  enough  of  the  sample  population  had  taicen  this 
test.  Some  of  the  other  determining  factors  ware  not  used 
oecause  it  was  decided  at  the  outset  that  the  variable  could 
not  oe  used  in  policy  decisions.  Borne  of  record  is  an 
example  of  tnis  type  of  variable.  All  variables  used  in  tne 
analysis  with  a description  of  their  types  and  ranges  are 
listed  in  Appendix  A. 


10 


III.  DEFINITION  OF  GROUPS 


Due  to  the  non-availability  of  data  listing  all  test 
takers  and  test  passers  for  the  August  1977  cycle,  a 
comparison  of  the  August  1977  EMR  and  the  August  1978  2MR 
Kas  necessary  to  determine  who  passed  the  exam.  The  2-4 
exam  is  given  only  twice  a year  and  all  test  passers  from  a 
given  cycle  who  are  advanced  no  3-4  are  advanced  from  three 
to  nine  months  after  the  cycle.  Therefore,  if  a member  was 
eligible  to  taice  the  exam  according  to  the  August  1977  file 
and  was  promoted  between  December  1977  and  hay  1973,  he  was 
classified  as  a test  passer.  If  he  was  eligible  to  take  tne 
exam  in  August  1977  but  he  was  not  promoted  during  the 
advancement  period,  he  was  classified  as  a non-test  passer 
regardless  of  whether  or  not  he  toon  the  exam.  This  is 
illustrated  in  the  Figure  1.  Finally,  if  he  was  eligible  to 
take  the  exam  in  August  1977  and  was  not  in  the  August  1973 
fi’e,  he  was  lasted  as  out  of  service  because  it  was  not 
possible  to  determine  if  he  passed  tae  exam  or  not. 
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SHADED  AREA 
INCLUDES  ALL 
NON-TEST  PASSERS 
AS  DEFINED  ABOVE 


Graphical  aepresenti  tion  of  Test  Passers  and  Non-test 

Passers 

Figure  1. 

Once  tae  groups  were  determined,  it  vas  possible  to 
evaluate  the  data  by  either  of  two  methods.  First,  since 
the  out  of  service  member  was  in  tae  Navy  at  the  time  the 
test  was  given,  he  was  actually  a test  passer  or  a non-test 
passer.  The  assumption  would  be  made  that  the  reason  he 
left  the  service  was  that  he  completed  his  time  in  service 
requirements.  Therefore,  time  alone  would  distinguish  tae 
out  of  service  member  from  the  test  passer  and  tae  non-test 
passer.  If  all  factors  involving  time  in  the  Navy  such  as 
length  of  service  and  time  in  rate  are  ignored,  then  the 
test  passer  and  non- test  passer  groups  may  be  looted  on  as  a 
random  sunset  of  the  original  test  passer  / non-test  passer 
/ out  of  service  data.  This  gives  just  two  all  inclusive 
groups  as  they  were  found  on  the  day  of  the  test. 
(Jnf ortunatei y,  one  is  forced  to  ignore  military  time  related 
factors  [variables  13,  14,  and  15,  Appendix  A]  which  might 
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prove  to  be  useful.  Even  so,  this  approach  might  well  be 

<i  -dtf- 

considered  useful  in  evaluating  enlistees  who  as  yet  have  no 
time  in  service. 

Ihe  second  method  alLows  the  inclusion  of  military  time 
dependent  factors.  In  this  approach  the  data  are  viewed  at 
the  test  cycle  plus  one  year  position  (August  1978)  . From 
this  time  perspective,  a member  may  fall  into  only  one  of 
the  three  groups;  test  passer,  non-test  passer,  or  out  of 
service.  Since  all  members  are  included  in  this  approach 
and  no  assumptions  of  random  sunsets  are  made,  the  time 
dependant  factors  may  be  used.  Although  classification  of 
out  of  service  personnel  may  at  first  seem  extraneous  to  the 
problem,  it  is  useful  knowledge,  and  may  in  fact  be  as 
important  as  identifying  the  test  passers. 

3oth  of  the  above  methods  of  grouping  nay  be  applied  at 
various  levels  of  the  Davy's  structure.  First  they  nay  be 
used  at  the  all  Navy  level.  Here  ail  S-3's  who  meet  the 
time  requirements  for  E-4  would  be  considered,  regardless  of 
rate.  Next,  the  individual  ratings  may  be  investigated. 
Before  a member  may  take  an  E-4  exam  in  a particular  rating, 
he  must  complete  certain  study  and  practical  factors 
requirements  for  that  rating.  dembers  in  the  process  of 
doing  this  are  referea  to  as  designated  strikers  in  that 
rating  and  are  identified  on  the  EdB  file.  3y  analyzing 
only  the  designated  strikers  in  a specific  rate,  it  may  oe 
possible  to  predict  the  test  passers  for  that  rate. 
Finally,  for  the  sample  available,  certain  ratings  had  too 
few  designated  strikers  to  give  any  conclusive  results.  To 
handle  these  cases,  hierarchical  clustering  methods  could  be 
used  to  group  ratings  according  to  similarity  of  specified 
attributes  of  the  ratings.  Each  group  of  ratings  could  then 
be  subjected  to  analysis. 
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The  data  used  in  tais  analysis  consisted  of  fifteen 
variables  per  meaner.  The  intent  was  to  divide  the  members 
into  groups  according  to  these  variables.  Hence,  the  use  of 
some  fora  of  multivariate  analysis  was  dictated. 

The  intent  of  the  analysis  was  two-fold.  First,  it  was 
necessary  to  find  out  if  a difference  existed  between  tae 
groups  in  terms  of  the  available  data.  Second,  if  a 
difference  did  exist,  was  it  possible  to  obtain  a 
classification  scheme  which  would  correctly  classify  an 
acceptably  high  percentage  of  the  members? 

Discriminant  analysis  is  one  method  of  performing  tae 
desired  analyses.  Discriminant  analysis  is  a multivariate 
statistical  technigue  used  for  constructing  decision  rules 
oy  waicn  data  units  (enlisted  meaoers)  aay  be  assigned  to 
groups  to  which  they  have  the  greatest  resemblance.  These 
decision  rules  are  statistical  functions.  The  independent 
variables  in  tae  functions  are  the  attributes  of  the  member. 
This  analysis  is  valid  for  both  tae  two  group  and  the  three 
group  case. 

There  are  three  basic  assumptions  underlying 
discriminant  analysis  [ 2] . First,  the  groups  must  be 
discrete  and  identifibla.  3oth  grouping  methods  defined  in 
the  last  section  meet  this  reguireaent.  Second,  each 
ooservation  in  each  group  can  be  described  oy  a set  of  m 
variables.  This  condition  is  met  by  use  of  the  fifteen 
determining  factors.  Finally,  all  a variaoles  are  assumed 
to  have  a multivariate  normal  distribution.  All  of  the 
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interval  lata  far  this  study  were  unimodal  and  symmetric. 
The  data  were  judged  to  nave  a close  enough  approximation  to 
a normal  distribution  to  meet  this  assumption.  The  binary 
data,  of  course  did  not.  However,  according  to  Gilbert  |3|, 
binary  data  may  be  used  with  little  loss  of  discriminating 
power.  This  is  because  the  rotations  of  the  axes  which 
occur  in  discriminant  analysis  cause  the  binary  data  to 
appear  to  be  binary  no  longer. 

As  an  example,  consider  a two  space  problem  of  binary 
vs.  interval  data: 


3inary  vs.  Interval  Data 
Figure  2. 

If  the  axis  is  rotated  to  obtain  a discriminant 
function,  tae  data  no  longer  appear  to  be  binary  vs. 
interval,  but  interval  vs.  interval  as  shown  in  the 
following  figure. 


Transformed  Binary  vs.  Interval  Data 
Figure  3. 

Given  that  all  the  basic  assumptions  for  the  use  of 
discriminant  analysis  are  satisified,  the  type  of 
discriminant  function  must  oe  chosen.  If  tae 
variance-covariance  matrices  of  the  groups  are  equal,  a 
linear  function  may  be  used.  If  they  are  not  equal  tae 
better  discrimination  is  achieved  by  a quadratic  function. 
A two-dimensional  illustration  of  tnis  is  shown  in  tne 
following  diagram.  The  inequality  of  the  covariance 
matrices  is  reflected  by  the  elliptical  contours  of  constant 
density. 


Quadratic  vs.  Linear  Discriminating  Functions 

Figure  4. 


The  quadratic  function  above  misclassif ies  only  one  data 
point,  whereas  the  linear  function  misclassif ies  five 
points.  When  given  a choice,  even  if  the 
variance-covariance  matrices  are  equal,  it  is  wise  to  choose 
tne  quadratic  function.  The  quadratic  snould  always  perfora 
at  least  as  well  as  the  linear  function. 

As  was  stated  in  Chapter  III,  soae  ratings  do  not  have  a 
large  enough  saaple  in  any  testing  cycle  to  allow  one  to 
conduct  a meaningful  analysis.  To  resolve  this  problem, 
cluster  analysis  was  used  to  group  tne  ratings. 

There  are  several  methods  of  duster  analysis  availanle. 
The  particular  type  used  in  this  study  is  termed 
hierarchical  clustering.  For  this  method,  the  data 
attributes  of  each  entity  (rating)  are  investigated  to 
determine  which  two  entities  are  the  most  alixe  1.4].  These 
two  entities  are  clustered  together  to  form  a new  entity  and 
an  averaged  set  of  variables  is  computed  for  the  newly 
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foraed  cluster.  This  cycle  is  repeated  until  there  is  one 
final  cluster  containing  all  of  the  original  entities.  The 
results  can  be  represented  as  a tree  with  all  the  original 
units  on  the  left  and  the  single  cluster  on  the  right. 


Hierarraical  Clustering  Tree 
Figure  5. 


The  aajor  advantage  to  the  hierarchical 
aethods  is  that  they  allow  one  to  view  th 
relationships  aaong  tne  data  units.  The  tree  iiag 
the  natural  clusters  which  occur  and  the 
siailarity  at  the  end  of  each  clustering  cycle.  F 
the  analyst  is  free  to  choose  the  optiaal  numner  o 
for  his  purposes. 


cl ustering 
e overall 
raa  shows 
degree  of 
roa  this, 
f clusters 


7.  TECHNICAL  DETAILS  OF  PROCEDURES  USED 


Discriminant  analysis  was  applied  to  the  data  to 
determine  if  a difference  existed  between  the  group  means 
and,  if  so,  to  construct  decision  rules  to  place  members  in 
the  proper  groups. 

The  first  step  in  the  analysis  was  to  determine  if  the 
variance-covariance  matrices  of  the  groups  were  egaal.  The 
results  of  this  dictated  whether  or  not  to  use  a linear  or 
quadratic  discriminating  function.  In  this  analysis,  every 
sample  produced  unequal  variance-covariance  matrices. 
Therefore,  the  quadratic  function  was  used  in  every  case. 

Next,  the  data  from  each  group  were  investigated  to 
determine  if  a difference  existed  between  the  means  of  the 
groups  in  terms  of  the  variables  being  investigated.  The 
derivation  of  the  two  group  case  is  presented  in  the 
following  paragraphs.  The  three  group  case  is  found  in 
Appendix  3. 

For  this  analysis,  the  groups  were  chosen  to  be  of  equal 

size  (N  = N = N | and  the  variance-covariance  matrices  are 
1 2 

assumed  to  be  unequal.  N observations  from  each  of  the  test 

passer  and  non-test  passer  groups  may  be  represented  in 

th 

vector  notation  as  follows,  where  the  a observation  from 
each  group  is  a 1 xm  vector 


x - 
n 


(x 


1 , n 


t 


) 

m,  n 
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where  n = 1, 
observation 


. . . , H and  i 

u and  y 

1 2 


is  the  nuioer  of  variables  in  each 
are  also  vectors  of  length  a. 


The  nail  hypothesis  for  this  section  of  tne  analysis  is  that 
the  means  of  the  groups  in  terms  of  the  the  determining 
factors  are  egual  or 


a : y = u 
0 1 2 


The  standacd  method  in  univariate  statistics  for 
determining  if  the  mean  of  a population  with  unicnown 
variance  is  egual  to  a specific  value  is  the  t-test.  Given 

2 

taat  the  population  has  a S(  y,  a ) distribution,  the  t 
statistic  is  defined  as  follows 

t = (Vif  (x-y)  ) /S 

The  extention  of  the  t statistic  to  aultivariate  analysis  is 
2 

the  T statistic  whacn  is  defined  as 


2 -1 
T = H (x-y)  'S  (x-y) 


Sow  let  y = x -x  and  N = S +S  . y is  defined  as 
i i,  1 i,2  12 


y = 1/N  Z (x  -x  ) = x -x 
i,  1 i, 2 12 
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|U 


and  the  variance-covaria  nee  matrix  is 


s = 1/  (3-1)  £jy.-y)  (y  . -y)  • 


= 1/  (N- 1 ) T (x  -X  -Y  +1  ) (x  -x  - X X ) • 
i , 1 i, 2 1 2 i , 1 i,2  1 2 


2 


The  T statistic  in  terms  of  y is 


2 -1 
r = 3 y • S y 


and  the  nail  hypothesis  as  stated  above  can  be  rewritten  as 


e : u - u = 0 
0 1 2 


to  accomodate  T in  terms  of  y. 


The  ooject  of  the  T test  is  to  find  a confidence  region 
aoout 


and 


u as  shown  below.  The  size  of  the 
2 


confidence  region  is  dependent  on  a 
significance  of  the  test. 


the  level  of 
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Confidence  Region  About  y - y 

1 2 

Figure  6. 

2 

With  a scaling  change,  the  I statistic  is  distributed 

according  to  an  F distrinution  with  a and  H-m-t  degrees  of 
freedom  I 5| . 

2 a 

r * (N-n»  F (a) 

H s-a-  i S-a-1 


Therefore,  if 


2 

T 

a 


fN-a-1) 

pr=7T« 


a 

F (a) 
H-m-  1 


the  hypothesis  that  the  aeans  are  equal  is  rejected. 

Once  the  inequality  of  the  aeans  has  been  estaolished, 
the  decision  rules  can  be  constructed.  Again,  the  two  group 
case  will  be  dealt  with  here  and  the  three  group  case  can  be 
found  in  Appendix  C. 
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Anderson  states  that  in  addition  to  the  basic 
discriminant  function,  the  classification  function  must  taxe 
into  account  the  a priori  probaDility  of  group  membership 
and/or  the  costs  of  misclass if icat ion.  Furthermore,  he 
states  that,  "the  good  classification  scheme  minimizes  the 
bad  effects  of  aisclassif ication"  | 5|  . As  a means  of 
exploring  this,  let  S be  the  misc lassif ication  function 
which  is  to  be  minimized. 


a » p(1  12)  tt  ♦ ? (2  | 1)  * 

ir  is  the  a priori  probability  of  an  observation  being 
h 

drawn  from  group  h,  and  P(g|h)  is  the  probability  of 

classifying  an  observation  as  a member  of  group  g when  it  is 
actually  a member  of  group  h.  The  costs  of 

misclassif ication  could  be  included  but  are  unknown  and  are 
not  dealt  with  in  this  study.  The  quantity  3 is  minimized 
by  the  following  rule:  assign  to  group  one  if 

f (X)  /f  (X)  > i / n 

1 2 _ 2 1 


Otherwise,  assign  to  group  two.  f ( x) 


density  function  of  population  i. 


normal  populations  with  unequal 
matrices,  the  above  equation  becomes 


1/2  1/2  -1 

C(2*>  C 1 exp-1/2t(X-Ui) 

t 72 — V7z~i : — “ 

rU")  L^J  ] exp-1/2  j.  (X-u  ) 


is  tne  probability 
Assuming  multivariate 
variance -co va  ri ance 


( x-u  2 ) 3 


> 


h 


TT 

1 


where  Z is  the  variance.  Taking  natural  logs  of  both  sides 
and  simplifying  yields 
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„ -1  -1  -1 
1/2  lnC^.2^  3-1/2[(X-  • Ei  (X- u ^ ) - ( X-u  ) • E (X-  u )1 


- "2/"t 


3y  rearranging  the  equation,  the  quadratic  decision  rule  is 
formed. 


-1  -1  -1  - 1 -1  -1 
X'U,  ~ E ) X - 2 (u  * E - u ' 1 ) X ♦ u*  E u - u • E u 

1 2 11  22  111222 

-1 

< la  f E • £ 1 - 21nf  t /tt  3 

- 2 1 2 1 


If  the  above  equation  is  true,  the  observation  should  be 
assigned  to  group  1.  If  not,  assign  the  observation  to 
group  2. 

It  is  also  possible  to  determine  the  contribution  that 
each  variable  makes  to  the  discriminant  function.  That  is, 
it  is  possible  to  compute  the  relative  amount  of 
discriminatory  power  that  each  variable  has. 


th 

Let  V oe  an  mxl  vector  of  coefficients  of  the  i 

j 

discriminating  function.  This  vector  is  transformed  into  a 

scaled  vector  V by  multiplying  each  element  of  v by  the 

i i 

square  root  of  the  corresponding  diagonal  element  of  tae 

pooled  within  groups  deviation  sum  of  squares  matrix  w, 
where 


I.  . = (X  . - X .)  (X  . - X .) 

1,1  gm  gi  gin  gi 
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The  scaled  vector  V'  is 


1 


V*  = [V  * 

1 i,  1 


v 

i,2 


,7.  » 1 

i,n  a, a 


The  discrimination  power  of  eaca  variable  can  now  be 
expressed  as 


rr 


1»  D 


The  method  of  hierarchical  clustering  used  in  this  study 
is  due  to  Ward  £6]  . This  method  is  based  on  the  computation 
of  the  Euclidean  distance  between  centroids  of  the  entities. 

th 

For  this  discussion,  let  x be  tae  value  of  the  i 

ijk 

th  th 

variable  of  the  j data  unit  in  the  it  cluster.  The  mean 


ta  th 

of  the  i variable  in  the  Jt  cluster  is 


r = 1/a  x 
iJc  It  ijit 


and  tae  error  sum  of  squares  for  cluster  it  is 


S = (X  - * ) 

It  igk  Lit 


The  total  within  group  error  sun  of  squares  is 


•I*. 
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The  criteria  for  the  Hard  method  is  the  minimization  of 
the  increase  in  the  within  group  error  sum  of  squares. 

dE  = E - E - 2 
P3  t p g 

where  2 and  E are  the  within  group  error  sum  of  squares 
P <3 

for  the  two  entities  which  are  joined  to  form  the  cluster  t. 
3y  simplification 


2 

dE  = m m (x  -x  ) 

P*  ,4 - ip  ^ 

P 3 


This  function  should  force  the  entities  to  group  into  tigat 
clusters.  Also,  this  should  cause  the  distance  between  the 
clusters  to  be  at  a maximum. 
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VI 


* BESULTS  0?  ANALYSIS 


The  discriminant  analysis  discu 
done  with  the  aid  of  computer 
Eisenbeis  and  Avery  [2).  These 
stepwise  procedure  to  guarantee 
function  possible  from  the  given 
investigates  every  possible  comoi 
factors  beginning  with  a subset  of 
at  factors  I2j  • F:r  each  subset  th 
variables  is  chosen.  Shen  the  a 
analyst  is  free  to  choose  the  subset 
for  his  purposes.  In  this  study,  a 
in  every  case. 
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analysis  to  find  which  rat 
clusters  were  formed  using 
Anderberg  1 4 ( . The  v 
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Decause  they  normally  rea 
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two  sample  populations  to  be  analyzed  were  individual 
ratings;  the  3oiler  Technicians  and  the  Machinist's  Mates. 
The  following  diagram  depicts  the  analyses  which  were  done. 
The  reader  may  find  it  helpful  to  refer  to  this  while 
reading  the  sucaeding  material. 


ALL  NAVY 

2 3 

GROUP  GROUP 

SN  CLSTR  L SN  CLSTR  2 SN  CLSTR  4 

2 3 2 3 2 3 

GROUP  GROUP  GROUP  GROUP  GROUP  GROUP 

BOILER  TECH  MACHINIST'S  MATE 

2 3 2 3 

GROUP  GROUP  GROUP  GROUP 


Discriminant  Analyses  Conducted  in  this  study 

Figure  7. 

For  the  all  Navy  sample,  300  members  of  each  group, 
i.e.,  test  passers,  non-test  passers,  and  out  of  service 
personnel  were  randomly  selected  from  the  eligible  pool. 
Pirst,  tne  two  group  case,  test  passars  vs.  non-test  passers 
was  analyzed.  After  checking  the  variance-covariance 
matrices  and  finding  them  to  be  unegual,  a comparison  of  tae 
group  means  was  made.  The  test  for  equality  of  group  means 
produced  an  F statistic  of  3.93  which  rejected  the  null 
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hypothesis  at  tha  0.01  Level  of  significance  (see  Appendix 
0) . The  variables  in  this  sample  which  provided  the 
greatest  power  of  discrimination  were  race  (24.5:4),  high 
school  diploma  (20.3%),  and  dependents  (13.594).  The 
analysis  indicated  Caucasian  members  nad  a higher 
probability  of  passing  the  exam  taen  their  non-Caucasian 
counterparts.  Individuals  having  a high  school  diploma 
and/or  having  dependents  also  were  more  likely  to  pass  the 
exam.  A complete  list  of  the  determining  factors  and  the 
amount  of  discriminating  power  for  each  sample  population  is 
listed  in  Appendix  S. 

Once  the  discriminant  function  was  computed,  tae 
classification  of  each  member  was  done  using  a guadratic 
function  as  described  in  Chapter  V.  me  rate  of  "hits"  or 
correct  classification  for  the  all  Navy  two  group  sample  was 
6 24  (see  Appendix  H)  . This  classification  rate  includes  the 
effects  of  the  a priori  probaoility  of  group  membership  as 
do  all  other  classif ication  rates  in  this  study. 

The  three  group  case  for  the  all  Navy  sample  was  also 
investigated.  Here  the  aguality  of  the  group  means  was 
rejected  at  tha  0.0  1 level  of  significance.  In  this  case 
the  length  of  time  remaining  in  service  (TAOS) , 32.04,  and 
length  of  service,  12.27%,  proved  to  be  the  most  powerful 
determining  factors.  Botn  of  these  variables  discriminated 
best  between  tha  out  of  service  group  and  the  other  two 
groups.  This  is  probably  because  a member  wita  one  year  or 
less  remaining  in  his  obligation  is  not  likely  to  take  the 
exam  if  ha  is  planning  to  leave  the  service.  Decause  the 
rewards  ha  will  reap  are  not  worth  tha  effort  to  prepare  for 
the  exam.  In  this  sample  the  rate  of  nits  was  53.0%,  which 
was  sligatly  better  than  the  two  group  case. 

Clustering  analysis  of  the  seamen  rates  resulted  in  four 
clusters  (see  Appendix  F)  . The  ratings  tended  to  group  by 
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joo  skills;  the  electronics  rates  in  one  cluster,  the 
administrative  rates  in  another,  etc.  Three  of  the  clusters 
were  subjected  to  discriminant  analysis.  The  first  cluster 
investigated  was  cluster  four  in  Appendix  ?.  The  test  for 
equality  of  group  means  generated  an  F statistic  of  1.99 
which  was  significant  at  the  2.6*  level.  The  largest 
discriminating  variables  in  this  sample  were  years  of 
education  (21.77*),  race  (15.0*),  the  numerical  reasoning 
section  of  the  BTB  (13.7*),  and  the  clerical  aptitude 
section  of  the  3TB  (10.  hoi).  For  this  sample  57*  of  tae 
observations  were  correctly  classified. 

Cluster  four  was  also  subjected  to  the  three  group 
discriminant  analysis.  Here  the  F statistic  was  5.25  and 

-30 

was  significant  at  the  10  level.  As  in  the  all  Savy 

group,  length  of  time  remaining  in  service  was  the  largest 
discriminator  (29S)  , but  was  followed  by  the  BTB  test  for 
shop  aptitude  (12.09*)  and  time  in  service  (3.24*).  The 
three  group  sample  was  correctly  classified  59.0*  of  the 
time. 


The  Seamen  rating  cluster  two,  two  group  sample,  was 
analyzed  next.  The  test  for  equality  of  group  means  was 
significant  at  the  12*  level,  indicating  that  the  means  of 
the  groups  were  close  to  one  another.  Since  the  means  were 
so  close,  the  discriminating  variables  and  tne 
classification  rates  should  be  viewed  with  care. 

The  three  group  case  of  the  Seamen  cluster  two  did  not 
test  as  having  equal  means  and  classified  63.0*  of  the 
observations  correctly.  Even  so,  only  43*  of  the  non-test 
passers  ware  correctly  classified. 

The  Seamen  cluster  one  consisted  mostly  of  electronics 
ratings.  The  two  group  test  of  equality  of  group  means 
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produced  an  F statistic  which  was  significant  at  the  45* 
level.  This  is  hard  to  accept  until  one  realizes  the  entry 
requirements  for  these  rates  are  very  specific.  Most 
entrants  have  a high  school  diploma,  at  least  twelve  years 
of  school,  have  gone  to  A-school,  etc.  In  short,  the 
cluster  is  s c homogeneous  in  terms  of  the  available  data 
that  it  was  not  possible  to  discriminate  between  the  groups. 

The  three  group  case  faired  somewhat  better  as  with  the 
other  clusters.  The  test  for  equality  of  the  means  was 
rejected  and  the  classification  was  correct  69. OS  of  the 
time.  However,  it  must  be  noted  that  only  33S  of  the  test 
passers  were  correctly  classified. 

The  test  of  group  means  for  the  Boiler  Technicians  two 
group  sample  resulted  in  an  P statistic  which  was 
significant  at  94*.  Because  of  this,  the  classification 
results  should  be  used  with  care.  Sven  though  the  three 
group  analysis  showed  the  means  were  not  equal,  the 
percentage  of  correct  classifications  was  only  50.0*.  Again 
it  was  this  good  only  because  of  tne  addition  of  the  out  of 
service  group.  The  distinction  between  the  test  passers  and 
non-test  passers  was  not  very  good,  with  only  23.0*  of  tne 
non-test  passers  being  correctly  identified. 

The  Machinist's  Mata  rating  two  group  case  produced 
surprising  results  when  the  test  of  the  equality  of  tne 
group  means  produced  an  P statistic  which  was  significant  at 
0.36*.  Considering  the  results  of  tne  other  samples,  one 
would  have  expected  the  significance  to  be  muca  larger.  The 
most  powerful  determining  factors  for  this  case  were  the  BT3 
tests  for  general  intelligence  (29.91*)  and  mechanical 
aptitude  (26.44*)  . Of  the  total  sample,  66*  was  classified 
correctl y. 

The  three  group  test  produced  an  P statistic  which  was 
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significant  at  10  . This  saapla  showed  tiae  caiainxng  in 

the  Navy  to  be  the  most  powerful  determining  factor  at 
18.05*.  Of  the  total  saaple,  66*  was  correctly  identified 
using  tne  decision  rules. 

?rom  the  above  results  it  can  be  seen  that  the  leans  in 
most  of  the  two  group  samples  are  too  close  together  to 
permit  any  distinct  discrimination.  Ihis  is  also  true  for 
the  three  group  samples  because,  although  the  out  of  service 
group  is  distinct  enougn  to  cause  the  equality  of  group 
aeans  test  to  fail,  the  test  passer  and  non- test  passer 
groups  remain  as  indistinct  as  oefore.  The  fact  that  tne 
all  Navy  saapla  tasted  as  not  having  equal  aeans,  while  all 
other  samples  did,  appears  to  be  due  to  the  variability  of 
the  data  in  the  all  Navy  case. 


VI r.  CONCLUSIONS 


Discriminant  analyses  and  cluster  analyses  were  used  in 
this  study  as  tools  to  attempt  to  deteraine  if  there  was  a 
difference  between  E-4  test  passers  and  non-test  passers  in 
teras  of  tae  available  biographic  data. 

The  study  has  shown  with  a high  level  of  certainty  that 
the  available  data  do  not  differentiate  sharply  between  the 
groups.  It  would  appear  that  the  entrance  reguireaents  for 
the  Navy  or  the  specific  rating  create  a homogeneous  group 
of  personnel  in  terms  of  these  variables.  Since  there 
appears  to  be  nothing  in  the  general  biographic  background 
of  a member  to  distinguish  between  the  groups,  other 
possible  explanitory  variables  might  be  inspected.  For 
instance,  where  was  a member's  last  duty  station,  what  type 
of  leaders  did  he  have,  was  he  training  in  a rating  he 
liked? 

Although  the  discriminating  variables  were  not  found  in 
the  study,  the  use  of  the  E-4  exam  as  a measure  of 
effectiveness  of  a member's  utility  to  tne  Navy  still  merits 
study.  Sharper  discrimination  might  result  if  data 
reflecting  actual  test  takers  is  used. 


APPENDIX  A 


LIST  OF  DETERMINING  FACTORS 


FACTOR 


1122 


RANGE 


1.  Dependents 


Binary 


0-No  dependents 
1 - Dependents 


2.  Age 


Inter  val 


19-33 


3.  Race 


3inary 


0- Non -Caucasian 
1 - Caucasian 


4.  Sex 


Binary 


0 - Female 

1 - Male 


5.  A-3chool 


Binary 


0 - No  A-School 

1 - A-School 


Interval  9-16 

Binary  0 - No  high 

scaool  diploma 
1 - high  school 
diploma 


BTB 

(Gd  Q • 

Intel. ) 

Interval 

31 

-74 

BT3 

(Nam  • 

Reason. ) 

Interval 

32 

-69 

6.  Years  of  education 

7.  Education 
cert  if ication 


34 


10.  BT8  (.lech.  Apt.) 

Interval 

37-70 

11.  STB  (Clr . Apt.) 

Interval 

38-76 

12.  3TB  (Shop  Apt.  ) 

I nterval 

37-74 

13.  liae  in  rata 

I nterval 

6-59  aos 

14.  Tiae  reaainiag 

I nterval 

0-43  aos 

15.  Length  of  service 

Interval 

12-72  aos 

35 


appendix  b 


K GROUP  TEST  FOR  EQUALITY  OP  GROUP  SEANS 


H : 
0 


u 

1 


u 

2 


u 

k 


Thxs  iaplied  u 

i 


u = 0 for  1 i i i j < k 

j 


Let 

“ (2)  = 


uWl)  = u - u =0 
1 2 

u/2*u/2-  u = 0 
1 2 3 


1) 


y/(k-1)  ♦ u/  (k-1 ) ♦ 

1 


Now  a 

0 


lT«(i)  = 0 
r-t 


Shich  is  the  fora  of  Anderson's  test  of  the  hypothesis 


a 

o 


*Vi 


o 


Let  y 

i 


3 x ♦ 3 x ♦ 

1 i,  1 2 i,2 


♦ 3 x 
k i ,k 


I5|. 
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APPENDIX  D 

rESTS  OF  EQUALITY  OF  GBOU?  HEANS 


EQUALITY  OF 

V AEI ANCE 

EQUALITY  OF 

SEANS 

F STATISTIC 

LEVEL 

? STATISTIC 

LEVEL 

All  Navy 

(2  Group) 

1.78 

0.0 

3.  83 

0.0 

All  Navy 

(3  Group) 

2.34 

0.0 

16.69 

0.0 

SN  Clstr  1 

(2  Group) 

1.38 

0.0 

.99 

45.  29 

SM  Clstr  1 

(3  Group) 

2.07 

0.0 

a.  15 

0.0 

SN  Clstr  2 

(2  Group) 

1.4  1 

.87 

1 . 49 

12. 36 

SN  Clstr  2 

(3  Group) 

1.75 

0.0 

3.  14 

0.  0 

SN  Clstr  4 

(2  Group) 

1.98 

0.0 

1.99 

2. 56 

SN  Clstr  4 

(3  Group) 

2.06 

0.0 

6. 25 

0.0 

BT 

(2  Group) 

2.08 

0.0 

. 4 1 

94. 70 
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i 


aa 

(3  Group) 


2.  10 


0.0 


0.0 


0.0 


4.46 


2.61 


7.43 


0.  0 
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APPENDIX  E 


POSES  OF  DETERMINING  PACTOSS 


ALL  NAVY 


2 GROUP 


Sace 

24.50* 

Edu.  Cert. 

20.30* 

Dependents 

13.  59 

Age 

10.55* 

3TB  (Clr.  Apt.) 

8.47* 

BTB  (Nua.  season.) 

5.58* 

BTB  (Mech.  Apt.) 

5.28* 

A-School 

4.59* 

BTB  (Gen  Intel.) 

4.59* 

Years  of  edu. 

1.57* 

BTB  (Shop  Apt.) 

0.48* 

Sex 

0.35* 

3 GBQUP 


Tiae  remaining 

32.03* 

Time  in  service 

12.27* 

Dependent  s 

7.84* 

Sex 

7.45* 

Edu.  Cert. 

6.99* 

rise  in  rate 

6.46* 

BTB  (Shop  Apt.) 

6.04* 

BTB  (Mech.  Apt.) 

4.85* 

A-School 

3.98* 

3T3  (Clr.  Apt.) 

3.04* 

Sace 

2.43* 

Age 

2.05* 

Years  of  edu. 

1.67* 

BTB  (Gen.  Intel.) 

1.49* 

BTB  (Nua.  Seas.) 

1.35* 
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SEAMEN  GHOO?  1 


2 GfiOOP 


BT3  (Shop  Apt.) 

29.43* 

3TB  (clr.  Apt) 

17.  88* 

3T3  (Num.  Season.) 

14.32* 

3X3  (Gen.  Intel.) 

8.55* 

3T3  (Mech.  Apt.) 

6.53* 

Dependen  ts 

6.07* 

A- School 

5.  14* 

Age 

4.23* 

Bace 

3.  17* 

2da.  cent. 

3.  14* 

ITS  Of  9QU. 

1.33* 

3 GBOUP 


Time  remaining 

35.72* 

3T3  (Gen.  Intel.) 

10.95* 

BTB  (Shop  Apt.) 

8.82* 

Age 

8.19* 

Tima  in  rate 

6.21* 

STB  (Clr.  Apt.) 

6.  17* 

3T3  (Mach.  Apt.) 

4.89* 

Time  in  service 

3.54* 

A-Scnool 

3.46* 

Dependents 

3.45* 

Sdu.  cert. 

3.3d* 

3TB  (Bam.  Season.) 

2.18* 

Irs  of  edu. 

1.93* 

Bace 

1.03* 
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sea  mss  aaocn?  2 


2 GROCJP 


A-3chool 

17.30* 

BTB  ( aech.  Apt.) 

14.98* 

3TB  (Clr . Apt.) 

13.35* 

3T3  (Shop  Apt.) 

11.26* 

Yrs  of  edu. 

9.62* 

Edu.  cert. 

9.  17* 

Sex 

7.  14* 

BTB  (Gen.  Intel.) 

4.35* 

BTB  (Mum.  Season.) 

4.73* 

Age 

2.81* 

Dependents 

2.40* 

Race 

1.30* 

3 3B0UP 


Time  remaining 

37.68* 

A-School 

6.76* 

Time  in  rate 

6.69* 

Dependents 

6.24* 

3T3  (Mum.  Season.) 

5.76* 

Ed u.  cert. 

5.54* 

Yr  of  edu. 

5.4  2* 

Time  in  service 

5.22* 

Race 

4.52* 

3T3  (Shop  Apt.) 

3.99* 

3T3  (3ech  Apt.) 

3.57* 

BTB  (Clr.  Apt.) 

3.49* 

BTB  (Gen.  Intel.) 

2.60* 

Sex 

1.39* 

Age 

1.05* 
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* 


1! 


SEAMEN  GHOUP  4 


2 GHOUP  3 GHOUP 


Years  of  edu. 

21.77% 

Tine  reaaining 

29.27% 

Hace 

15.04* 

3TB  (Shop  Apt.) 

12.09% 

3TB  (Nub.  .Season.) 

13.70% 

Tima  in  service 

3.24* 

3TB  (Clr.  Apt.) 

10.46* 

Dependents 

7.94* 

3TB  (Shop  Apt.) 

10.31* 

Edu.  cert. 

7.  14% 

Sex 

6.  13* 

3T3  (Gen.  Intel.) 

6.67% 

Edu.  cert. 

6.01* 

A-School 

4.39* 

Dependents 

4.52% 

Years  of  edu. 

4.15* 

BT3  (Gen.  Intel.) 

3.41* 

BTB  (Nua.  Season.) 

4.06* 

A-School 

3.  32* 

Sex 

3.97% 

Age 

2.39% 

Hace 

3.94* 

3T3  (Seen.  Apt.) 

2.39* 

Tiae  in  rate 

2.58* 

BTB  (Mech  Apt.) 

1.96* 

3TB  (Clr.  Apt.) 

1.79* 

Age 

1.74* 
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r 


30ILSR  TECHNI Cl  AS 


2 GROUP 

i GROUP 

3 TB  (Gen.  Intel.) 

18.  1 1* 

Tiae  remaining 

18.05* 

3TB  (Nua.  Reason.) 

16.964 

rears  of  edu. 

15.39* 

8TB  (Clr.  Apt.) 

15.  62* 

3TB  (Shop  Apt.) 

12.84* 

rears  of  eda. 

12.324 

Time  in  service 

10.73* 

Eda.  cert. 

10.30% 

Age 

10.46* 

A-School 

9.43* 

BT3  (!lech.  Apt.) 

10.25* 

3T3  (Shop  Apt.) 

6.  99* 

Edu.  cert. 

7.46* 

Dependents 

4.39* 

A-School 

3.23* 

3T3  (Sech.  Apt.) 

4.05* 

3T3  (Gen.  Intel.) 

3.13* 

Age 

1.42* 

Tiae  in  rate 

2.92* 

Race 

0.35* 

3T3  (Rum.  Reason.) 

2.19* 

9 

Dependents 

1.39* 

Race 

1.30* 

3TB  (Clr.  Apt.) 

0.58* 
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A 


rr 


machinist's  sate 


2 GROUP 

3 GROUP 

BTB  (Gen.  Intel.) 

29.91* 

Time  remaining 

29.56* 

BTB  (Hech.  Apt.) 

26.44* 

BTB  (Hech.  Apt.) 

15.55* 

Age 

10.01* 

Edu.  Cert. 

11.17* 

BTB  (Hum.  Reason.) 

9.6  8* 

Time  in  service 

8.30* 

BTB  (Clr.  Apt.) 

7.97* 

BTB  (Gan.  Intel.) 

6.91* 

Dependents 

5.15* 

Time  in  rate 

6.26* 

Years  of  edu. 

5.03* 

BTB  (Clr.  Apt.) 

5.29* 

A-5caool 

2.4  1* 

Age 

4.65* 

Race 

1.70* 

Years  of  edu. 

3.88* 

Edu.  cert. 

1.53* 

Race 

3.05* 

BTB  (Shop  Apt.) 

0.  12* 

BTB  (Shop  Apt.) 

1.30* 

BTB  (Mum.  Reason.) 

1.65* 

A-School 

1.62* 

Dependents 

0.25* 

46 


APPENDIX  F 

RESULTS  OP  CLUSTERING  OF  SEAMEN  RATINGS 


CLUSTER  1 

CLUSTER  2 

STS  - 

Sonar  Tech  (S) 

SK  - 

Storekeeper 

STG  - 

Sonar  Tech  (G) 

PC  - 

Postal  Clark 

FTG  - 

Pira  Cntl  Tech  (gun) 

IN  - 

Yeoaan 

FTM  - 

Pira  Cntl  lech  (aissle) 

DK  - 

Dispursing  Clerk 

FTB  - 

Pira  Cntl  Tech  (FBM ) 

MS  - 

Mess  Specialist 

STM  - 

Electronics  Tech  (conn) 

sa  - 

Ships  serviceaan 

ETR  - 

Electronics  Tech  (radar) 

LI  - 

Lithographer 

EM  - 

Elec  Warfare  Tach 

CTA  - 

Coaa  Tech  (adain) 

CTH  - 

Coaa  Tach  (saint.) 

TM  - 

Tor pedoaan 1 3 Mate 

CTI  - 

Coaa  Tach  (intarp. ) 

3M  - 

Boatswain's  Mate 

D5  - 

Data  Systeas  Tech 

MT  - 

Missle  Tech 

CLUSTER  3 


CLUSTER  4 


GAG  - 

Gunner  s Sate 

(gun) 

CTT  - 

Coaa  Tech  (tech) 

GMM  - 

Gunner's  Sate 

( aissles) 

CTO  - 

Coaa  Tech  (coaa) 

GST  - 

Gunner's  Bate 

(tech) 

CT3  - 

Coaa  Tech  (collection) 

an  - 

aineaan 

DP  - 

Data  Processing  Tech 

oa  - 

Qpticalaaa 

IS  - 

Intelligence  Spec 

a a - 

aaster  at  Arms 

PH  - 

Personnelaan 

ia  - 

Instr uaentaan 

aa  - 

Radioaan 

DM  - 

Draftsman 

QM  - 

Quartermaster 

OS  - 

Operations  Spec 

OT  - 

Ocean  Sys  Spec 

MU  - 

Musician 

JO  - 

Journalist 

LN  - 

Legalaan 
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APPENDIX  G 

CLASSIFICATION  RESULTS 


i AaPLE  % £0  ERECTLY  CLASSIFIED 

test  non  test  out  of 


TOTAL 

PASSERS 

PASSERS 

SERVICE 

All  Navy 

(2  Group) 

52% 

80% 

45% 

— 

All  Navy 

(3  Group) 

63% 

77% 

39% 

7 1% 

SN  clstr  1 

(2  Group) 

74% 

32% 

38% 

“ *“ 

5N  Clstr  1 

(3  Group) 

69% 

33% 

79% 

72% 

SN  Clstr  2 

(2  Group) 

63% 

77% 

49% 

• "" 

SN  Clstr  2 

(3  Group) 

6 3% 

71% 

43% 

76% 

SN  Clstr  4 

(2  Group) 

67% 

81  % 

50% 

• 

SN  Clstr  4 

{3  Group) 

69% 

80% 

4 2% 

77% 

3T 

60% 

86% 

34% 

-- 

(2  Group) 
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8T  57* 

(3  Group) 

an  66* 

(2  Group) 

aa  66* 

(3  Group) 


30* 


714 

524 


23* 

604 

584 


604 


774 
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APPENDIX  H 


CLUSTERING  tree  for  jeamen  rating 


SEAMEN  i 

IT;'*  MAM  6 

10 

MO. 

SC  !*•  TF.'I-  S 

a 

fl  = £ OtL  *ECH  G 

14 

s-r«a  teck  g 

7 

fi'c  cntl  tfch  m 

15 

6lEC  WAPFaOE  T £CM 

6 

mi S5L3  tech 

17 

CC*'*  " 

27 

E LcZ  r8CM  CS  TECH  N 
ELECTS  CM  KS  TECH  R 

19 

20 

H :E  C*irt.  r gCF  9 

16 

data  StS  TECH 

21 

CO  mm  tech  I 

30 

Qis  ie P*S  G 

13 

MI  (s=  ma  -4 

13 

CL)V:=r  *S  mate  M 

11 

GUMMc  n 1 $ MA'c  T 

12 

IM  $rS(J«E  N7MAN 

22 

«l SrfS  AT  ASMS 

2 

C P ~I  CALHiN 

23 

COMM  TECH  T 

25 

= ES3  C*'VELH  W 

33 

CCVM  * ECrt  9 

29 

CiTi  s‘,T'*FSS  TECH 

34 

IN~ ELL  'SEC 

38 

comm  t=-CH  c 

28 

SIG'ML  MM 

4 

P ACIC'MN 

i<* 

j Sir TCMAM 

43 

C’.J  irr  pbuaSTER 

3 

CFESA'IJNS  ?PEC 

5 

CVE4N  STS  TECH 

9 

v i 51  Cl  4'i 

44 

JwCR'-.ALHAN 

40 

LEGAL'’  AM 

32 

>~C9cKEE=£9 

35 

postal  c lepk 

41 

C'v<  TECH  4 

26 

y grvt  j'i 

31 

oi  S3  a*  si  mo  clerk 

36 

9°  iT?W»  !>'•  $ MATE 

1 

LI  'HCGPAPHEF 

42 

HESS  SPEC 

37 

SHi?*E  <E=VlCcM&M 

39 

’■CPPEOCRAK’  S “ATE 

10 

IT  EH  *M*£ 

10 

MO. 

4TINGS  3-78 

X 2 3 4 5 6 


12  3*56 


7 8 


9 10  11  12  13  1*  15 


1 

I 

t 

I 

I 

I 

f 

I 

I 

I 

1 NOTE: 


-I 

1 

I 

I 

I 

I 

I 

I — 

I 

I 

I 

I 

I 

I 

-I 


FINAL  2 BRANCHES 
JOIN  AT  CLASS  25 


7 8 


9 10  11  12  13  14 


15 
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St  amen  9ATIGS  3-78 

This  -uv  OEOKTS  THE  PORTION  OF  THE  -REE  GENERATED  BETWEEN  stage  1 ANO  STAGE, 

"he  cpite=ic:  values  a=  e segmented  into  the  fallowing  classes. 

CLASS  LCwE  R SQ'JND  UPPER  BCUNC 


1 

0 .199/266 7E- 

03 

0.  19  7 92  12  9F- 

01 

l 

0.197-.2129  E— 0 l 

0 .5933952  IE- 

01 

3 

Q. 29339521E- 

•Cl 

C. 5692691 8E-01 

4 

0 . 569269  18  E- 

01 

C. 76515285E- 

01 

5 

0.78c19235E- 

■01 

0. 93111629 E- 
C.ll  7702  9 7E 

0 1 

o 

0.  96  U 1629E- 

01 

00 

J 

0.1 1 77CJ9  7E 

00 

0.12  729632E 

oc 

a 

0. 1 372 9 6326 

0O 

7.l56dd856E 

00 

9 

0 . 1 56  t 6 6ooc 

00 

C.  1 56981  C1E 

cc 

10 

0 .176981016 

00 

0. 19607235E 

oc 

1 1 

C. 19607335E 

00 

0.2 1566570E 

00 

l z 

0 -21566  5 70!: 

UJ 

0.  2 2 5 25  609E 

oc 

13 

0 .2  3525  304  E 

OJ 

0. 2598503 9E 

00 

I A 

0.25965C39E 

00 

C.  2 79992  73  E 

00 

IS 

0 .27999^736 

00 

C.2S9035C3E 

oc 

16 

0.29903509 E 

00 

J.31362792E 

0 0 

l l 

0. 312o2792f 

00 

C. 33321 977E 

oc 

1 3 

0.33321977E 

70 

0 . J5291211E 

oc 

I 9 

C. 352  31 21 1 S 

00 

0 . 3 72  9 0 9 9 6E 

00 

20 

C .2  72-.0996E 

00 

0.  3 51S963CE 

oc 

21 

0 ,3C 1 9G690  c 

Go 

0.91 15891 5E 

00 

22 

0.91  1569  15  E 

0 7 

7. 9311819 9E 

00 

2J 

0 .931181996 

00 

0.  95077389E 

00 

29 

0. 9 5 07  7 J8  E 

00 

0 .9  7 3366 1 8 E 

00 

25 

C.97C36616E 

00 

C. 96995972E 

00 
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